Czech-English Machine Translation Dictionary

نویسنده

  • Magdalena Prokopová
چکیده

We are proposing a format for translation dictionaries suitable for machine translation. The dictionary format is concise and generalizes phrases by introducing rules for morphological generation instead of using simple phrase to phrase mapping. We describe a simple way how to automatically construct our compact entries from a machine-readable dictionary originally intended for human users using parallel corpora. We further describe how to expand the compact dictionary entries to phrase table dictionary that can be used further on by machine translation systems (until the systems will support morphological generation from a translation dictionary natively). We performed manual annotation of a small set of entries to analyze problems of this approach.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Prague Czech-English Dependency Treebank. Syntactically Annotated Resources for Machine Translation

This paper introduces the Prague Czech-English Dependency Treebank (PCEDT), a new Czech-English parallel resource suitable for experiments in structural machine translation. We describe the process of building the core parts of the resources – a bilingual syntactically annotated corpus and translation dictionaries. A part of the Penn Treebank has been translated into Czech, the dependency annot...

متن کامل

Czech-English Dependency-based Machine Translation

We present some preliminary results of a Czech-English translation system based on dependency trees. The fully automated process includes: morphological tagging, analytical and tectogrammatical parsing of Czech, tectogrammatical transfer based on lexical substitution using word-to-word translation dictionaries enhanced by the information from the English-Czech parallel corpus of WSJ, and a simp...

متن کامل

Czech-English Dependency Tree-based Machine Translation

We present some preliminary results of a Czech-English translation system based on dependency trees. The fully automated process includes: morphological tagging, analytical and tectogrammatical parsing of Czech, tectogrammatical transfer based on lexical substitution using word-to-word translation dictionaries enhanced by the information from the English-Czech parallel corpus of WSJ, and a simp...

متن کامل

An MT System Recycled

This paper describes an attempt to recycle parts of the Czech-to-Russian machine translation system (MT) in the new Czech-to-English MT system. The paper describes the overall architecture of the new system and the details of the modules which have been added. A special attention is paid to the problem of named entity recognition and to the method of automatic acquisition of lexico-syntactic in...

متن کامل

On A Device In Dictionary Operations In Machine Translation

A special programme converting classes of words of international usage directly from English to Czech is described in its application in an experiment of machine translation as well as in general environments. The words undergo special morphemic analysis, they are adapted morphemically and orthographically to the target language form and,in the experimental version, they are assigned pertinent ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007